Least information document representation for automated text classification
نویسندگان
چکیده
منابع مشابه
Automated Text Classification for Fast Feedback - Investigating the Effects of Document Representation
New trends such as increased product complexity, changing customer requirements and shortening development time, have given rise to an increase in the number of unexpected events within the Product Development Process (PDP). Traditional tools are only partially adequate (either insufficient coverage or simply too late) to cover these unexpected events. As such, new tools are being sought to com...
متن کاملDocument Vector Space Representation Model for Automatic Text Classification
Classification of text documents presents a unique challenge to conventional classification algorithms. Due to the existence of large number of features in the datasets, providing a desired representation for text documents can be seen as another problem. In this paper a simple but effective representation model for text documents to tackle the classification problem is discussed. Two different...
متن کاملEnhanced Information Retrieval from Narrative German-language Clinical Text Documents using Automated Document Classification
The amount of narrative clinical text documents stored in Electronic Patient Records (EPR) of Hospital Information Systems is increasing. Physicians spend a lot of time finding relevant patient-related information for medical decision making in these clinical text documents. Thus, efficient and topical retrieval of relevant patient-related information is an important task in an EPR system. This...
متن کاملA Joint Semantic Vector Representation Model for Text Clustering and Classification
Text clustering and classification are two main tasks of text mining. Feature selection plays the key role in the quality of the clustering and classification results. Although word-based features such as term frequency-inverse document frequency (TF-IDF) vectors have been widely used in different applications, their shortcoming in capturing semantic concepts of text motivated researches to use...
متن کاملDistributional Semantic Representation for Text Classification and Information Retrieval
The objective of this experiment is to validate the performance of the distributional semantic representation of text in the classification (Question Classification) task and the Information Retrieval task. Followed by the distributional representation, first level classification of the questions is performed and relevant tweets with respect to the given queries are retrieved. The distributiona...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Proceedings of the American Society for Information Science and Technology
سال: 2012
ISSN: 0044-7870
DOI: 10.1002/meet.14504901118